Information Status Distinctions and Referring Expressions: An Empirical Study of References to People in News Summaries

نویسندگان

  • Advaith Siddharthan
  • Ani Nenkova
  • Kathleen McKeown
چکیده

Although there has been much theoretical work on using various information status distinctions to explain the form of references in written text, there have been few studies that attempt to automatically learn these distinctions for generating references in the context of computer-regenerated text. In this article, we present a model for generating references to people in news summaries that incorporates insights from both theory and a corpus analysis of human written summaries. In particular, our model captures how two properties of a person referred to in the summary—familiarity to the reader and global salience in the news story—affect the content and form of the initial reference to that person in a summary. We demonstrate that these two distinctions can be learned from a typical input for multi-document summarization and that they can be used to make regeneration decisions that improve the quality of extractive summaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Learning Cognitive Status for Multi-Document Summarization of Newswire

Machine summaries can be improved by using knowledge about the cognitive status of news article referents. In this paper, we present an approach to automatically acquiring distinctions in cognitive status using machine learning over the forms of referring expressions appearing in the input. We focus on modeling references to people, both because news often revolve around people and because exis...

متن کامل

Associations between the Empirical Dietary Inflammatory Index and Cognitive Function Status in Community-Dwelling Elderly People of Tehran, Iran

Background and Objectives: Inflammation plays important roles in development of several chronic diseases, including cognitive functions. Neuritis in the brain can lead to decreased cognitive function in elderly people. Diet is one of the factors affecting inflammation. The empirical dietary inflammatory index is a novel tool that assesses the overall inflammatory potential of diets by generatin...

متن کامل

Automatically Acquiring Fine-Grained Information Status Distinctions in German

We present a model for automatically predicting information status labels for German referring expressions. We train a CRF on manually annotated phrases, and predict a fine-grained set of labels. We achieve an accuracy score of 69.56% on our most detailed label set, 76.62% when gold standard coreference is available.

متن کامل

(Un)Translatability of Persian Idiomatic Expressions to English in Political Discourse

The present study sought to investigate the extent to which Persian idiomatic expressions would influence the western translators' strategies in providing the ultimate product in English, and it also attempted to uncover the underlying assumptions in target text, then to suggest some weighty strategies to overcome difficulties with translation. For this purpose, the data was analyzed within the...

متن کامل

Survey of companions of cancer patients about the need and how to express getting incurable cancer

Introduction: Expressing bad news in medicine is one of the most important measures taken by medical staff that should be given to patients in special circumstances that it is necessary to examine the views of companions and patients in this regard. Therefore, the aim of this study was to investigate the necessity and manner of expressing bad news (incurable cancer) from the perspective of canc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Linguistics

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2011